Evaluating paragraph retrieval for why-QA
نویسنده
چکیده
We implemented a baseline approach to why-question answering based on paragraph retrieval. Our implementation incorporates the QAP ranking algorithm with addition of a number of surface features (cue words and XML markup). With this baseline system, we obtain an accuracy-at-10 of 57.0% with an MRR of 0.31. Both the baseline and the proposed evaluation method are good starting points for the current research and other researchers working on the problem of why-QA. We also experimented with the addition of smart question analysis to our baseline system (answer type and informational value of the subject). This however did not give significant improvement to our baseline. In the near future, we will investigate what other linguistic features can facilitate re-ranking in order to increase accuracy.
منابع مشابه
Evaluating Answer Extraction for Why-QA using RST-annotated Wikipedia texts
In this paper the research focus is on the task of answer extraction for why-questions. As opposed to techniques for factoid QA, finding answers to whyquestions involves exploiting text structure. Therefore, we approach the answer extraction problem as a discourse analysis task, using Rhetorical Structure Theory (RST) as framework. We evaluated this method using a set of why-questions that have...
متن کاملParagraph retrieval for why-question answering Exploiting discourse structure for intelligent paragraph retrieval for why-QA
Finding answers to why-questions involves finding arguments in texts, rather than the noun phrases that are typical targets for factoid questions. Detecting arguments requires detecting specific rhetorical structures and relations. Therefore, we proposed the use of Rhetorical Structure Theory (RST) as a tool for discovering answer to why-questions in paragraphs that are likely to contain the an...
متن کاملExploiting discourse structure for intelligent paragraph retrieval for why-QA
Finding answers to why-questions involves finding arguments in texts, rather than the noun phrases that are typical targets for factoid questions. Detecting arguments requires detecting specific rhetorical structures and relations. Therefore, we proposed the use of Rhetorical Structure Theory (RST) as a tool for discovering answer to why-questions in paragraphs that are likely to contain the an...
متن کاملTHUIR at TREC 2004: QA
In this paper, we describe ideas and related experiments of Tsinghua University IR group in TREC 2004 QA track. In this track, our system consists three components: Question analysis, Information retrieval, and Answer extraction. Question analysis component extracts Query Term and answer type. Information retrieval component retrieves candidate documents from index set based on paragraph level ...
متن کاملJU_CSE_TE: System Description QA@CLEF 2010 - ResPubliQA
Abstr act. The article presents the experiments carried out as part of the participation in the Paragraph Selection (PS) Task and Answer Selection (AS) Task of QA@CLEF 2010 – ResPubliQA. Our System use Apache Lucene for document retrieval system. All test documents are indexed using Apache Lucene. Stop words are removed from each question and query words are identified to retrieve the most rele...
متن کامل